AITopics | class-imbalanced data

ClimbQ: Class Imbalanced Quantization Enabling Robustness on Efficient Inferences Ting-An Chen 1,2, De-Nian Y ang 2,3 Ming-Syan Chen 1,3 1

Neural Information Processing SystemsFeb-12-2026, 19:18:11 GMT

Despite a realistic problem, quantization on class-imbalanced data has not been studied.

artificial intelligence, machine learning, quantization, (16 more...)

Neural Information Processing Systems

Country:

Asia > Taiwan (0.05)
North America > Canada > Ontario > Toronto (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

Neural Information Processing SystemsDec-24-2025, 19:08:24 GMT

Several techniques based on re-weighting and margin adjustment of loss are often used to enhance the performance of neural networks, particularly on minority classes. In this work, we analyze the class-imbalanced learning problem by examining the loss landscape of neural networks trained with re-weighting and margin based techniques. Specifically, we examine the spectral density of Hessian of class-wise loss, through which we observe that the network weights converges to a saddle point in the loss landscapes of minority classes. Following this observation, we also find that optimization methods designed to escape from saddle points can be effectively used to improve generalization on minority classes. We further theoretically and empirically demonstrate that Sharpness-Aware Minimization (SAM), a recent technique that encourages convergence to a flat minima, can be effectively used to escape saddle points for minority classes.

effective generalization, escaping saddle point, minority class, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.83)

Add feedback

Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data

Neural Information Processing SystemsDec-24-2025, 16:21:31 GMT

We propose a novel unsupervised generative model that learns to disentangle object identity from other low-level aspects in class-imbalanced data. We first investigate the issues surrounding the assumptions about uniformity made by InfoGAN, and demonstrate its ineffectiveness to properly disentangle object identity in imbalanced data. Our key idea is to make the discovery of the discrete latent factor of variation invariant to identity-preserving transformations in real images, and use that as a signal to learn the appropriate latent distribution representing object identity. Experiments on both artificial (MNIST, 3D cars, 3D chairs, ShapeNet) and real-world (YouTube-Faces) imbalanced datasets demonstrate the effectiveness of our method in disentangling object identity as a latent factor of variation.

elastic-infogan, name change, unsupervised disentangled representation learning, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

ClimbQ: Class Imbalanced Quantization Enabling Robustness on Efficient Inferences Ting-An Chen 1,2, De-Nian Y ang 2,3 Ming-Syan Chen 1,3 1

Neural Information Processing SystemsAug-19-2025, 18:34:35 GMT

Despite a realistic problem, quantization on class-imbalanced data has not been studied.

artificial intelligence, machine learning, quantization, (16 more...)

Neural Information Processing Systems

Country:

Asia > Taiwan (0.05)
North America > Canada > Ontario > Toronto (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

Neural Information Processing SystemsJan-17-2025, 18:42:49 GMT

Several techniques based on re-weighting and margin adjustment of loss are often used to enhance the performance of neural networks, particularly on minority classes. In this work, we analyze the class-imbalanced learning problem by examining the loss landscape of neural networks trained with re-weighting and margin based techniques. Specifically, we examine the spectral density of Hessian of class-wise loss, through which we observe that the network weights converges to a saddle point in the loss landscapes of minority classes. Following this observation, we also find that optimization methods designed to escape from saddle points can be effectively used to improve generalization on minority classes. We further theoretically and empirically demonstrate that Sharpness-Aware Minimization (SAM), a recent technique that encourages convergence to a flat minima, can be effectively used to escape saddle points for minority classes.

effective generalization, escaping saddle point, minority class, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.87)

Add feedback

Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data

Neural Information Processing SystemsOct-11-2024, 10:29:54 GMT

We propose a novel unsupervised generative model that learns to disentangle object identity from other low-level aspects in class-imbalanced data. We first investigate the issues surrounding the assumptions about uniformity made by InfoGAN, and demonstrate its ineffectiveness to properly disentangle object identity in imbalanced data. Our key idea is to make the discovery of the discrete latent factor of variation invariant to identity-preserving transformations in real images, and use that as a signal to learn the appropriate latent distribution representing object identity. Experiments on both artificial (MNIST, 3D cars, 3D chairs, ShapeNet) and real-world (YouTube-Faces) imbalanced datasets demonstrate the effectiveness of our method in disentangling object identity as a latent factor of variation.

class-imbalanced data, elastic-infogan, unsupervised disentangled representation learning, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

Rangwani, Harsh, Aithal, Sumukh K, Mishra, Mayank, Babu, R. Venkatesh

arXiv.org Artificial IntelligenceDec-28-2022

Real-world datasets exhibit imbalances of varying types and degrees. Several techniques based on re-weighting and margin adjustment of loss are often used to enhance the performance of neural networks, particularly on minority classes. In this work, we analyze the class-imbalanced learning problem by examining the loss landscape of neural networks trained with re-weighting and margin-based techniques. Specifically, we examine the spectral density of Hessian of class-wise loss, through which we observe that the network weights converge to a saddle point in the loss landscapes of minority classes. Following this observation, we also find that optimization methods designed to escape from saddle points can be effectively used to improve generalization on minority classes. We further theoretically and empirically demonstrate that Sharpness-Aware Minimization (SAM), a recent technique that encourages convergence to a flat minima, can be effectively used to escape saddle points for minority classes. Using SAM results in a 6.2\% increase in accuracy on the minority classes over the state-of-the-art Vector Scaling Loss, leading to an overall average increase of 4\% across imbalanced datasets. The code is available at: https://github.com/val-iisc/Saddle-LongTail.

artificial intelligence, effective generalization, machine learning, (2 more...)

arXiv.org Artificial Intelligence

2212.13827

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.73)

Add feedback

The Hidden Uniform Cluster Prior in Self-Supervised Learning

Assran, Mahmoud, Balestriero, Randall, Duval, Quentin, Bordes, Florian, Misra, Ishan, Bojanowski, Piotr, Vincent, Pascal, Rabbat, Michael, Ballas, Nicolas

arXiv.org Artificial IntelligenceOct-13-2022

A successful paradigm in representation learning is to perform self-supervised pretraining using tasks based on mini-batch statistics (e.g., SimCLR, VICReg, SwAV, MSN). We show that in the formulation of all these methods is an overlooked prior to learn features that enable uniform clustering of the data. While this prior has led to remarkably semantic representations when pretraining on class-balanced data, such as ImageNet, we demonstrate that it can hamper performance when pretraining on class-imbalanced data. By moving away from conventional uniformity priors and instead preferring power-law distributed feature clusters, we show that one can improve the quality of the learned representations on real-world class-imbalanced datasets. To demonstrate this, we develop an extension of the Masked Siamese Networks (MSN) method to support the use of arbitrary features priors.

artificial intelligence, machine learning, representation, (16 more...)

arXiv.org Artificial Intelligence

2210.07277

Country:

North America > Canada > Quebec > Montreal (0.14)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

Population structure-learned classifier for high-dimension low-sample-size class-imbalanced problem

Shen, Liran, Er, Meng Joo, Yin, Qingbo

arXiv.org Machine LearningSep-10-2020

The Classification on high-dimension low-sample-size data (HDLSS) is a challenging problem and it is common to have class-imbalanced data in most application fields. We term this as Imbalanced HDLSS (IHDLSS). Recent theoretical results reveal that the classification criterion and tolerance similarity are crucial to HDLSS, which emphasizes the maximization of within-class variance on the premise of class separability. Based on this idea, a novel linear binary classifier, termed Population Structure-learned Classifier (PSC), is proposed. The proposed PSC can obtain better generalization performance on IHDLSS by maximizing the sum of inter-class scatter matrix and intra-class scatter matrix on the premise of class separability and assigning different intercept values to majority and minority classes. The salient features of the proposed approach are: (1) It works well on IHDLSS; (2) The inverse of high dimensional matrix can be solved in low dimensional space; (3) It is self-adaptive in determining the intercept term for each class; (4) It has the same computational complexity as the SVM. A series of evaluations are conducted on one simulated data set and eight real-world benchmark data sets on IHDLSS on gene analysis. Experimental results demonstrate that the PSC is superior to the state-of-art methods in IHDLSS.

artificial intelligence, boxplot, machine learning, (15 more...)

arXiv.org Machine Learning

2009.04722

Country:

Asia > China > Liaoning Province > Dalian (0.05)
North America > United States > California > Santa Clara County > Santa Clara (0.04)
North America > United States > North Carolina (0.04)
(5 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Health & Medicine > Therapeutic Area > Hematology (0.68)
Health & Medicine > Therapeutic Area > Oncology > Leukemia (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Identifying and Compensating for Feature Deviation in Imbalanced Deep Learning

Ye, Han-Jia, Chen, Hong-You, Zhan, De-Chuan, Chao, Wei-Lun

arXiv.org Machine LearningJan-5-2020

In practice, however, we frequently encounter training data with a class-imbalanced distribution . For example, modern real-world large-scale datasets often have the so-called long-tailed distribution: a few major classes claim most of the instances, while most of the other minor classes are represented by relatively fewer instances [16, 31, 38, 50, 51, 61]. Classifiers trained with this kind of datasets using conventional strategies (e.g., mini-batch SGD on uniformly sampled instances) have been found to perform poorly on minor classes [3, 19, 40, 52], which is particularly unfavorable if we evaluate the classifiers with class-balanced test data or average per-class accuracy. One common explanation to the poor performance is the Figure 1: Over-fitting to minor classes and feature deviation: (top-left) the number of training (red) and test (blue) instances per class of an imbalanced CIFAR-10 [8, 32]; (top-right) the training and test set accuracy per class using a ResNet [20]; (bottom) the t-SNE [41] plot of the training (circle) and test (cross) features before the last linear classifier layer. We see a trend of over-fitting to minor classes, which results from the feature deviation of training and test instances (see the magenta and red minor classes).

accuracy, classifier, minor class, (15 more...)

arXiv.org Machine Learning

2001.01385

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Ohio (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)
Europe > Poland (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.83)

Add feedback

Filters

Collaborating Authors

class-imbalanced data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

ClimbQ: Class Imbalanced Quantization Enabling Robustness on Efficient Inferences Ting-An Chen 1,2, De-Nian Y ang 2,3 Ming-Syan Chen 1,3 1

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data

ClimbQ: Class Imbalanced Quantization Enabling Robustness on Efficient Inferences Ting-An Chen 1,2, De-Nian Y ang 2,3 Ming-Syan Chen 1,3 1

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Class-Imbalanced Data

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

The Hidden Uniform Cluster Prior in Self-Supervised Learning

Population structure-learned classifier for high-dimension low-sample-size class-imbalanced problem

Identifying and Compensating for Feature Deviation in Imbalanced Deep Learning